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Chapter  One 
Introduction 


!n  order  for  a  mobile  robot  to  maneuver  through  its  environment  and  execute  any  sort 
of  reasonably  intelligent  task,  it  should  first  be  able  to  perceive.  That  is.  it  should  be  able  to 
build  a  mode!  of  its  world  based  on  sensory  information.  An  alternative  approach  might  be 
to  assume  a  near  perfect  model  of  the  world,  and  perform  task  planning  from  that,  as  is 
'  often  done  in  CAD  based  manipulator  robotics  {Lozano- Perez  81.  Brooks  84).  However,  in 
contrast  to  stationary  arm  robots  which  are  fixed  to  a  global  coordinate  frame,  a  mobiL 
robot's  world  is  essentially  unknowable  due  to  cumulative  errors,  and  sensing  must  be  done. 

Various  types  of  sensors  have  been  used  in  the  past,  such  as  bumper  switches,  shaft 
encoders,  sonar  transducers  [Chatiergy  85],  photocells  and  infrared  proximity  sensors 
[F.verctt  82a],  cameras  [Nilsson  69a.  Moravec  81a],  infrared  beacons  [Giralt  33]  and  laser 
rangefinders  [Thompson  79],  Each  type  of  sensor,  however,  has  some  limitation.  Shaft 
encoders  aren't  accurate  when  wheels  slip,  for  instance.  Sonar  sensors  have  a  wide 
bcamwidth  and  are  sensitive  to  specular  surfaces  and  cameras  require  computationally 
intensive  processing.  One  solution,  that  which  is  followed  here,  is  to  use  redundant  sensors 
and  utilize  the  ad'  antageous  characteristics  of  some  in  overcoming  the  disadvantages  of 
others. 

This  thesic  describes  how  two  very  inexpensive  sensors,  a  sonar  rangefinder  and  a 
novel  infraied  sensor,  are  coupled  to  produce  data  that  is  better  for  building  a 
representation  of  the  robot's  environment  than  using  the  sensors  individually.  The  sonar 
rangefinder  measures  the  distance  to  an  object  but  has  poor  angular  resolution  due  to  its 
wide  bcamwidth.  In  contrast,  the  infrared  sensor,  though  not  able  to  measure  distance 
accurately,  has  good  angular  resolution  in  detecting  the  absence  or  presence  of  an  object 
By  using  both  sensors  to  scan  a  room,  the  robot  is  able  to  build  a  better  map. 


I  he  in  (hired  sensor  is  .ihle  lo  find  edges  of  doorways  and  narrow  passages  that  would 
he  otherwise  blurred  by  the  sonars  I  he  boundary  of  data  points  which  is  initially  created 
hy  the  suiiar  readings  is  redrawn  appropriately  to  mark  the  doors  detected  by  the  infrared. 
I  he  curs  at u re  primal  sketch  [Brady  84]  is  then  extracted' from  this  modified  boundary,  and 
significant  curvature  points  are  used  ;ls  landmarks  for  matching  between  scans  as  the  robot 
niuxes.  Scans  from  subsequent  moves  can  be  merged  and  a  room  map  boundary  is  created. 
I  his  is  then  transformed  into  a  list  of  polygons  in  order  to  provide  the  necessary  input  for 
path  planners  based  on  generalized  cones  (Brooks  83]  or  configuration  space  (Brooks  85], 

Ibe  robot  used  in  this  work  was  Robart  It  [Kverett  85a],  Figure  1-1,  built  by  LCOR 
Bart  Kverett,  Director  Office  of  Robotics  and  Autonomous  Systems  for  the  Naval  Sea 
Systems  Command,  Washington.  D.C.  Robart  was  designed  as  a  sentry  robot  and  was 
loaned  to  the  Naval  Surface  Weapons  Center,  Silver  Spring,  MD  as  a  mobile  platform  for 
research  and  evaluation  of  sensors  and  navigation  algorithms. 

rhe  sonar  sensor  was  a  Polaroid  ultrasonic  transducer  [Polaroid  84]  and  the  infrared 
detector  was  designed  and  built  by  LCDR  Everett.  The  infrared  sensor  had  four  levels  of 
power  output  and  four  stages  of  detector  sensitivity.  A  description  of  the  sensors  and  an 
analysis  of  their  limitations  is  given  in  Chapter  Two.  Chapter  Three  displays  examples  of 
these  limitations  in  plots  of  actual  experimental  data,  formulates  rules  for  combining  the 
<Jdta,  and  shows  how  a  better  map  can  be  extracted  than  if  either  sensor  were  used  alone.  In 
Cnaptcr  Four,  this  modified  map  is  converted  into  another  representation,  the  curvature 
primal  sketch.  Significant  changes  in  curvature  are  used  as  landmarks  for  tracking  between 
robo'.  noves.  A  model  of  the  room  is  then  built  up  from  a  succession  of  these  scans. 
Chap'  7  be  illustrates  how  this  representation  is  used  as  input  to  a  planner  and  examples 
of  two  planners  which  use  the  senory  data  as  input  are  given. 
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Related  Work  on  Intelligent  Mobile  Robots 


Starting  in  the  late  sixties,  a  few  large  projects  were  begun  in  the  United  States, 
Prance.  Fngland  and  Japan  to  develop  autonomous  robots.  However,  funding  was 
gradually  reduced  as  these  projects  failed  to  produce  all  that  they  had  proposed. 
Researchers  th  n  began  to  focus  on  many  of  the  subproblems  that  the  early  work  had 
exposed  to  be  much  harder  than  expected:  problems  such  as  vision,  natural  language  and 
planning  in  uncertain  environments.  Since  then,  there  has  not  only  been  tremendous 
progress  in  image  understanding,  natural  language  and  planning,  but  major  breakthroughs 
in  microelectronics  to  the  extent  that  we  now  have  massively  parallel  computers  with  which 
to  tackle  these  problems  in  real  time.  Consequently,  research  is  again  picking  up  on  the  task 
of  integrating  all  these  modules  to  produce  truly  intelligent  autonomous  vehicles. 

I'he  survey  which  follows  outlines  the  work  which  has  been  done  in  the  past  on 
mobile  robots  and  summarizes  some  of  the  projects  being  pursued  now.  Special  emphasis  is 
placed  on  now  these  endeavors  have  tackled  or  solved  the  problem  of  building  a  memory 
map  and  using  such  a  mode!  for  the  purposes  of  navigation. 

2.1  Shakey  1967-1969 

Some  of  the  earliest  and  yet  at  the  same  time  most  sophisticated  work  in  applying 
artificial  intelligence  to  robots  was  done  at  the  Stanford  Research  Institute  in  the  late  sixties 
on  an  automaton  named  Shakey  (Nilsson  69b,  Coles  69].  Shakey,  Figure  2*1,  operated  off  a 
large  time  sharing  computer,  an  SDS  940,  by  radio  link  and  had  both  a  FORTRAN 
executive  for  control  and  I/O,  and  a  LISP  executive  for  maintaining  its  world  model. 

Its  main  sensor  was  a  rotatable  camera,  and  with  this  sense  of  vision  and  its  many 
levels  of  software,  it  was  able  to  navigate,  explore  and  learn.  This  was  some  of  the  earliest 


Figure  2-1  :Shakcy  -  Moravec  81b 


work  in  machine  vision,  and  one  lesson  learned  was  that  vision  was  a  very  hard  problem, 
Shakcy  also  had  natural  language  capability.  A  person  could  type  in  an  English  sentence 
command,  and  Shakcy  would  parse  the  sentence  and  call  up  the  appropriate  FORTRAN  or 
MSI*  programs  to  carry  out  the  command. 

Shakcy 's  view  of  the  world  came  from  two  models:  a  grid  model  and  a  property  list 
model.  I  he  grid  model  divided  the  room  up  into  nested  4x4  arrays  called  cells,  where  each 
element  of  the  array  was  called  a  square.  ITie  entire  world  consisted  of  one  cell,  in  which 
each  square  could  be  marked  as  full,  partly  Tu 1 1  or  empty.  Partly  full  squares  could  then  be 
represented  as  cells  and  further  subdivided  into  4x4  arrays  of  squares.  Thus  the  room  could 
be  resolved  to  any  desired  level  of  detail,  while  its  representation  would  require  only  a 
minimal  amount  of  computer  memory.  This  idea  later  evolved  into  the  quadtree 
representation  which  is  still  often  held  to  be  a  desiicd  representation  for  w  ork  in  recognition 
ami  planning.  From  the  model,  obstacle-avoiding  trajectories  could  be  calculated  as  shown 
in  Figure  2-2,  It  was  more  difficult  however,  to  plan  journeys  by  using  the  grid  model  than 
by  using  a  fully  divided  large  array  (Rosen  0>S],  Additional  information  had  to  be 
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Figure  2-2:Shakcy's  Grid  Model  -  Nilsson  69a 


maintained  to  help  programs  using  the  grid  model,  such  as  depth  of  the  cell  in  the  model, 
coordinates  of  the  cell,  lengths  of  the  sides,  and  pointers  to  parent  squares  or  cells. 

Vision  was  used  as  an  input  to  the  grid  model.  The  camera  would  take  a  picture, 
convert  it  to  a  line  drawing,  determine  floor  boundaries  of  objects,  and  calculate  free  floor 
space.  It  would  then  add  full  and  empty  areas  into  the  grid  model. 

One  problem  was  that  the  robot's  position  was  "dead  reckoned"  by  keeping  track  of 
wheel  rotations,  and  errors  due  to  slippage  caused  Shakcy  to  miscalculate  its  position.  This 
forced  the  vision  system  to  incorporate  objects  incorrectly  into  the  grid  model.  Because  of 
this,  it  was  noted  that  effective  reorientation  techniques  would  be  an  important  area  for 
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future  study. 


Although  the  grid  model  was  usable  foi  journey  planning  when  the  robot  was  only 
concerned  about  free  or  empty  arc;is,  the  grid  model  was  not  suitable  for  other  functions 
such  as  object  identification. 


A  SQUARE 
FLOOR  AREA 


THE  GRID- 
MOOEL 
VERSION  OF 
THE  SQUARE 
AREA 


Figure  2-3:The  Grid  Model  Cannot  Clearly  Represent 
the  Obstacle  as  a  Square  -  Rosen  68 


As  seen  in  Figure  2-3.  the  jagged  edges  in  the  grid  model's  representation  of  the  square 
made  it  hard  for  the  robot  to  recognize  it  as  such.  To  solve  this  problem,  a  line  model  was 
proposed  in  which  visual  images  would  be  processed  into  line  drawings  and  a  straight-line 
representation  of  obstacles  would  be  used  for  a  model.  This  was  not  successful,  however, 
due  to  the  inability  of  vision  systems  at  that  time  to  provide  the  accuracy  needed. 


In  addition  to  the  grid  and  line  models,  a  property  list  model  was  utilized.  The 
properly  list  model,  later  becoming  the  n-tuplc  model,  represented  objects  in  terms  of  their 
properties,  using  LISP  type  data  constructs.  Thus  an  object  somewhere  in  the  room  might 
be  denoted  its  an  ordered  list  of  such  features  as  x-ccordinatc.  y-coordinate,  angle,  size, 
shape,  etc.  The  property  list  model  was  used  for  interpreting  commands  such  as  "GO  TO  A 
BOX".  The  coordinates  would  be  looked  up  under  an  object  named  "BOX",  then  the  grid 
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model  would  he  accessed  by  I  OR  I  KAN  routines  to  determine  collision -free  paths  and  to 
carry  out  the  task. 

The  integration  of  the  hierarchical  levels  of  software  gave  Shakey  the  sophistication  to 
remain  the  state  of  the  art  robot  •'or  many  years.  What  is  odd.  is  that  Shakey,  at  the  time, 
was  considered  a  failure  or  at  least  an  example  of  something  the  Al  community  had 
promised  but  couldn't  deliver  -  namely,  a  completely  autonomous  robot  Shakey's 
environment  had  to  be  very  simple  for  all  his  systems  to  work,  and  he  was  very  slow,  and 
well,  "shakey".  Funding  on  mobile  robot  research  diminished  and  sponsors  became 
disenchanted  with  Al  in  general  for  various  reasons  [Dreyfus  79],  This  was  mainly  due  to  a 
change  of  plans  at  the  Defense  Advanced  Research  Projects  Agency  and  not  for  scientific 
reasons. 

The  main  lesson  learned  was  that  the  instinctive  skills  which  are  easy  for  humans,  such 
as  seeing,  moving,  etc.,  are  very  hard  to  program  into  a  robot,  whereas  higher  level  functions 
that  are  hard  for  humans,  such  as  calculating,  are  much  easier  for  a  robot 

One  of  the  contributors  to  the  Shakey  project  was  once  asked  if  all  the  work  that  went 
into  Shakey  could  have  been  done  in  software  as  a  simulation.  His  answer  was  negative, 
because  they  wouldn't  have  known  what  to  simulate.  The  difficulty  lay  in  designing 
algorithms  for  poor  data,  not  for  perfect  data,  and  they  would  not  have  known  in  which 
ways  the  data  would  have  been  poor  [Raphael  68). 

After  Shakey,  funding  was  continued  in  the  areas  of  vision,  natural  language 
processing  and  planning  as  serious  problems  in  and  of  themselves,  and  not  necessarily  as 
subproblerrs  of  a  mobile  robot  system. 

2.2  The  Jet  Propulsion  Laboratory  Mars  Rover  1970-1973 

in  the  early  seventies  NASA  began  a  project  to  develop  a  rover  to  be  used  in  planetary 
exploration  [Lewis  73.  Dobrotin  77.  Miller  77,  l  ewis  77,  Thompson  79],  It  had  been  noted 
in  previous  Viking  missions  that  due  to  long  telecommunication  delays  it  had  taken  several 
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Figure  2-4:The  JPL  Mrs  Rover  •  Moravec  81b 

The  JPL  robot.  Figure  2-4,  consisted  of  a  mobile  vehicle  equipped  with  a  six-degree- 
of- freedom  manipulator  (a  modified  Stanford  arm)  and  an  assortment  of  sensors  (laser 
range-finder,  stereo  TV  cameras,  tactile  sensors  and  proximity  sensors).  The  navigation 
system  used  a  gyrocompass  and  optical  encoders  on  the  wheels  for  dead  reckoning.  An 
on-board  mini-computer  (General  Automation  SPC-16  with  32K  memory)  for  real-time 
control  of  motors  communicated  with  a  remote  POP- 10  on  the  Arpanet.  The  remote  system 
was  used  to  process  TV  and  laser  pictures,  to  construct  the  "world  model"  and  to  do 
planning  and  decision  making.  The  robot,  however,  never  advanced  beyond  the  stage  of 
being  tethered  with  a  50-100  foot  cable. 

The  rover's  objectives  were  to  analyze  a  scene  for  traversability.  plan  a  path  to  the  goal 


and  follow  that  path  without  humping  into  anything.  These  objectives  were  achieved  only 
in  a  simplified  environment  consisting  of  a  laboratory  with  a  flat  surface,  a  limited  number 
of  obstacles  and  constant  illumination. 
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Figure  2-5: The  JPL  Rover’s  Map  -  Thompson  79 

The  mode!  of  the  world  held  by  the  JPL  Rover  was  a  segmented  terrain  model 
derived  by  inputs  from  the  vision  system.  Since  the  area  explored  by  the  robot  was  large, 
the  terrain  model  was  partitioned  into  map  sectors  of  a  convenient  size  and  stored  as 
separate  files.  I  nch  sector  was  a  fixed  lattice  of  grid  lines  drawn  parallel  to  the  Rover’s 
absolute  coordinate  system.  T  he  resultant  collection  of  map  sectors  was  similar  to  a  catalog 
of  charts.  Fach  map  sector  represented  areas  that  were  either  not  traversable  or  unknown. 
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as  shown  in  Figure  2-5.  All  other  areas  were  assumed  traversable.  Non-traversablc  regions 
were  described  as  boundaries  of  polygons  and  these  regions  were  then  represented  as  lists  of 
the  veniccs  of  those  polygons. 

This  map  had  to  be  continually  updated  while  the  robot  moved  around  performing  its 
assigned  task,  and  errors  frequently  got  incorporated  into  the  modei.  The  first  source  of 
error  was  the  uncertainty  in  vehicle  position  due  to  dead  reckoning.  This  error  increased 
with  the  distance  from  a  known  location.  The  second  source  of  error  was  the  limitation  of 
the  vision  system  to  accurately  determine  relative  positions  of  obstacles.  Once  an  internal 
model  was  built,  the  Rover  could  refer  to  that  model  and  using  various  search  algorithms, 
plan  an  optimum  route  o  its  goal. 

Although  the  JPI.  Rover  project  was  able  to  produce  several  useful  robotic  subsystems 
such  as  the  manipulator,  the  laser  rangefinder  and  the  navigation  system,  putting  them 
together  did  not  result  in  a  completely  autonomous  robot  as  desired.  The  tether  still 
remained  and  improvements  were  still  needed  to  reduce  errors  in  the  respective  subsystems 
so  that  the  final  system  would  be  able  to  act  intelligently  and  with  a  higher  level  of 
coordination.  It  was  the  classic  case  of  an  attempt  at  system  building  before  the  technology 
for  the  components  was  available. 

2.3  The  Stanford  Cart  1973-1981  and  CMU  Robots  1981  - 

From  1973  to  1981,  work  was  done  at  the  Stanford  University  Artificial  Intelligence 
Lab  by  Hans  Moravec  on  developing  a  remotely  controlled  TV  equipped  mobile 
robot  [Moravec  31b,  Moravec  83],  A  crude  cart  was  used  as  the  mobile  platform,  but  a 
sophisticated  vision  system  and  appropriate  navigation  and  obstacle  avoidance  software 
enabled  the  Cart  to  move  through  cluttered  spaces. 

The  Cart  with  its  camera  system  is  shewn  in  Figure  2-6.  The  Cart  used  stereo  imaging 
to  locate  objects  and  to  deduce  its  own  motion.  A  TV  link  connected  the  Cart  to  a  remote 
KL-10,  which  sent  control  commands  to  the  Cart  and  also  did  all  image  processing.  The 
camera  on  top  of  the  Cart  was  mounted  on  rails  and  slid  by  remote  control  to  nine  different 


figure  2-6:1  he  Stanford  Cart  -  Moravcc  81b 


positions  to  pet  nine  pictures  of  the  view  before  it.  These  pictures  were  then  digi’ized  and 
processed  to  extract  31)  information  from  die  scene. 

Processing  of  the  pictures  amounted  to  extracting  features  front  each  picture  and  then 
v  -ov  luting  those  feature  points  between  any  two  images,  features  were  extracted  by 
running  an  '"interest  operator"  over  each  digitized  picture,  which  would  pick  out  areas  in 
the  piastre  which  hud  the  maximum  gradient  of  grey  scale.  Thus  points  such  as  the  comer 
of  a  table  would  be  picked  out  because  the  top  of  the  table  might  be  well' lit  while  the  side 
was  dimmed  by  shadow  feature  points  would  be  marked  in  as  many  of  the  nine  pictures  as 
possible  and  then  a  correlator  routine  would  compare  that  feature  point's  change  in  pixel 
position  between  any  two  pictures.  Knowing  that  information  and  the  distance  that  the 
camera  had  moved  gave  distance  to  the  object.  Nine  pictures  were  used  to  increase 


Figure  2-7:'ITie  Cart's  View  of  the  World  -  Moravec  81b 
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The  digitized  mage  with  its  feature  points  marked  is  shown  in  Figure  2-7.  Also 
shown  is  the  path  which  the  Cart  had  planned  to  reach  its  goal.  This  information  was  used 
to  build  a  model,  and  front  this  model  it  would  plan  an  obstacle  avoiding  path  to  its 
destination.  The  system  worked  but  w.ts  slow  due  to  many  factors.  'These  ranged  from  the 
many  computations  needed  to  deduce  the  cart's  own  motion  since  its  own  dead  reckoning 
system  was  so  weak,  to  the  fact  that  the  system  utilized  interpreted  LISP  tunning  on  pre-LSI 
technology.  The  Cart  would  move  one  meter,  stop,  take  pictures,  think  for  fifteen  minutes, 
and  then  move  forward  another  meter.  The  Cart  successfully  maneuvered  through  several 
20  meter  courses  (each  taking  about  five  hours)  but  failed  in  other  runs. 

Some  problems  in  these  runs  were  that  featureless  objects  were  hard  to  see,  and  also 
that  shadows  often  moved  considerably  during  the  course  of  the  run,  throwing  off  the 
correlator  since  shadows  produced  new  feature  points  due  to  their  high  contrast  Another 
problem  involved  weaknesses  with  the  vision  system's  ability  to  maintain  an  accurate  self¬ 
position  model.  Although  the  model  was  to  be  updated  after  each  lurch,  small  errors  in  the 
measured  feature  positions  sometimes  caused  the  solver  to  converge  to  a  position  with  an 
error  beyond  the  expected  uncertainty.  Any  features  incorporated  into  the  model  after  the 
Cart  lost  its  correct  sense  of  self-position  were  inserted  wiongly.  These  errors  were 
cumulative  and  caused  the  same  object  to  seem  to  be  in  another  place.  The  combination  of 
old  and  new  positions  of  these  objects  made  it  appear  to  the  Cart  that  the  path  was  blocked 
when  in  actuality  it  was  open. 

Much  of  this  research  has  continued  at  Carnegie  Mellon  University  in  a  number  of 
systems  which  they  have  built  there.  Figure  2-8  shows  one  type  of  world  mode!  they  use  to 
represent  sensory  uata  [Moravec  85j.  Twenty  four  Polaroid  sonar  transducers  are  mounted, 
in  a  ring  around  the  robot  and  the  data  from  separate  sensors  arc  combined  in  a  probability 
map  which  represents  areas  that  are  cither  empty,  occupied  or  unknown.  Each  cell  in  the 
grid  represents  six  square  inches  of  floor  space,  and  the  value  in  each  cell  can  range  from  -1 
to  1.  Negative  numbers  represent  a  probability  that  the  area  is  empty,  while  positive 
numbers  mean  it's  probably  occupied.  Zero  represents  the  unknown.  The  basic  idea  is  that 
because  of  the  wide  bcamwidth  of  the  sonar,  there  may  or  may  not  be  an  object  in  the  line 
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of  sight;  some  object  might  be  picked  up  by  the  edges  of  the  son.ir  beam.  In  addition  to  a 
probabilistic  measure  in  terms  of  angle,  range  can  also  be  modeled.  For  any  sonar  range 
reading  there  might  be  some  small  error,  but  most  of  the  area  up  to  that  distance  can 
confidently  be  marked  as  empty.  By  combining  information  from  many  readings  its  the 
robot  moves  through  the  room,  areas  known  to  be  empty  or  occupied  are  expanded,  and  the 
uncertainties  associated  with  each  region  are  decreased.  The  effect  is  that  object  locations 
become  known  with  increasing  precision. 


Figure  2-8:Empty,  Unknown  and  Occupied  Areas  in  a  Sonar  Map  -  Moravec  85 

2.4  Ililare  1977- 

Work  began  in  1977  in  France  at  the  Laboratoire  d'Automatique  et  d’Analyse  des 
Systemes  to  develop  an  autonomous  robot  that  utilized  multiple  sensors  and  would  be 
equipped  with  a  multi-level  computer  and  decision  system  [Briot  81,  Bauzil  81,  Ferrer  81]. 

Flilare,  Figure  2-9,  has  a  3D  vision  system  which  uses  a  laser  range-finder  in 
conjunction  with  a  video  camera.  Its  sensor  system  also  incorporates  ultrasonic  devices  as 
proximity  detectors  for  close-in  obstacle  detection  and  for  paralleling  a  wall.  It  uses  a 
system  of  infrared  beacons  mounted  on  the  walls  in  the  corners  of  its  room  to  give  it 
absolute  positioning  information.  This  works  by  using  two  infrared  emitters  and  detectors 
on  the  robot.  Measurements  of  angles  are  made  by  counting  control  pulses.  The  multi-level 
computer  system  consists  of  three  8085  on-board  microprocessors  for  sensory  data 


Figure  2-9:Hilare  -  Ferrer  81 


processing,  an  off-board  MITRA-15  minicomputer  for  navigation  and  communication 
tasks,  and  a  remote  IBM-370  used  as  a  peripheral  to  the  minicomputer  for  complex  tasks. 

A  distributed  decision-making  capability  is  provided  through  a  system  of  cooperating 
expert  modules  which  have  expertise  in  the  areas  of  object  identification,  navigation, 
exploration  and  planning.  These  modules  consist  of  specialized  knowledge  bases, 
algorithms  and  heuristics,  error  processing  capabilities,  and  communication  procedures. 
This  system  enables  liilarc  to  carry  out  navigation  tasks  which  involve  universe  modeling, 
building  a  plan,  and  supervising  the  development  and  execution  of  dial  plan  [Giralt 
77.  Laumond  83],  Hiiarc's  world  model  defines  obstacles  as  polyhedrons  whose  projections 
on  the  floor  determine  the  navigation  problem.  This  model  can  either  be  determined  by  the 
robot's  perception  system  or  provided  as  initial  information. 


Figure  2-10:Hilare‘s  World  Model  -  Girait  77 


The  obstacles  are  represented  as  an  ordered  list  of  segments  where  each  segment  is 
represented  by  the  Cartesian  coordinates  of  its  leftmost  point,  an  angle  with  respect  to  some 
reference  axis,  and  its  length.  As  seen  in  Figure  2-10,  empty  areas  are  partitioned  and 
represented  as  convex  polygonal  cells  which  include  obstacle  segments.  Trajectories  within 
cells  are  straight  line  paths  between  entry  and  exit  segments  so  that  adjacent  cells  have 
common  segments  which  are  traversable  by  the  robot  This  pattern  of  connectivity  can  then 
be  represented  as  a  graph,  which  provides  the  structure  necessary  for  path  finding. 

Optimum  paths  are  determined  by  making  a  search  over  the  resulting  graph  while 
minimizing  costs  in  terms  of  distance  and  energy  requirements.  The  minimization  function 
is  a  linearly  weighted  combination  of  path  length,  angle  of  planned  direction  change,  and 
the  number  of  predicted  slops,  together  with  a  term  which  accounts  for  the  uncertainty  of 
information  obtained  by  the  robot  and  also  the  path  viability  due  to  estimated  obstacle 
clusterings. 

The  model  is  built  up  by  merging  information  from  laser  rangefinder  scans  as  the. 
robot  moves  from  one  position  to  another  [Chatila  85].  Perceived  obstacles  are  assigned 
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their  own  coordinate  frames.  Before  the  robot  moves,  it  predicts  what  it  might  see.  That  is. 
it  hypothesizes  which  edges  might  become  occluded  and  which  might  become  uncovered. 
Then  after  moving  to  its  new  location,  it  matches  the  predicted  model  to  what  it  actually 
perceives.  If  some  adjustment  must  be  made  to  bring  the  two  into  alignment,  that  that  can 
be  used  to  update  the  robot's  position.  Figure  2-11  shows  four  figures  (from  left,  to  right 
respectively)  that  depict  the  process  of  perceiving,  predicting,  perceiving  and  merging  the 
models. 
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Figure  2-1  l:Mcrging  Perceived  and  Predicted  Models  -  Charila  85 


2.5  Robart  I  1980-1982 

Robarl  I  was  built  at  the  Naval  Postgraduate  School  by  LCDR  Bart  Everett,  to  serve 
as  a.  feasibility  demonstration  for  an  autonomous  robot  [Everett  82b.  Everett  82a],  Robart, 
Figure  2-12.  would  randomly  patrol  a  house  sensing  for  fire,  smoke,  tlooding,  toxic  gas, 
intrusion,  etc.,  and  take  appropriate  warning  action  if  any  of  these  conditions  were  found. 
The  goal  of  the  project  was  to  show  that  certain  applications  could  indeed  be  handled  by 
..  'tonomous  mobile  robots,  using  current  technology,  under  the  right  conditions.  The 
particular  application  of  a  sentry  was  chosen  because  it  did  not  require  any  end  effectors  or 
a  vision  system,  iuv.  project  was  done  on  an  extremely  limited  budget,  using  simplified 


approaches:  the  philosophy  being  that  if  successful  under  those  conditions,  tin  extrapolation 
should  show  die  tremendous  potential  if  later  addressed  w  ith  sufficient  funding. 

The  robot  had  a  single  forward  looking  ultrasonic  ranging  unit,  a  long  range  near- 
infrared  proximity  detector  that  could  be  positioned  by  a  rotating  head,  ten  short  range 
near-infrared  proximity  detectors,  and  tactile  feelers  and  bumper  switches  for  collision 
avoidance.  The  battery  voltage  was  constantly  monitored  and  when  it  fell  below  a  certain 
adjustable  threshold,  the  robot  would  activate,  via  a  radio  link,  a  homing  beacon  placed  on 
lop  of  its  recharging  station.  For  simplicity,  an  ordinary  75  watt  light  bulb  was  used  as  the 
beacon,  tracked  by  an  optical  photocell  array  located  on  the  robot's  head.  Thus  the  head 
position  represented  the  relative  bearing  to  the  beacon,  and  the  robot  could  home  in  on  the 
battery  recharger.  The  software  provided  verification  of  the  correct  beacon  acquisition,  the 
ability  to  maneuver  around  obstructions  enroute,  and  the  correction  of  any  misalignment 
that  occurred  as  a  result  of  collision  avoidance. 

Other  sensors  onboard  included  a  true-infrared  body  heat  sensor  which  could  detect  a 
person  out  to  a  distance  of  fifty  feet  This  sensor  was  fairly  directional,  and  mounted  on  the 
head  so  as  to  be  positionable  under  software  control.  Also  mounted  on  the  head  was  a 
near-infrared  long  range  proximity  sensor  with  a  parabolic  reflecting  collector,  able  to  detect 
the  edges  of  an  open  doorway  to  within  an  inch  at  a  distance  of  six  feet.  This  angular 
resolution  allowed  the  robot  to  steer  toward  the  center  of  the  doorway  while  still  some 
distance  away. 

In  addition  to  its  multitude  of  sensors,  the  five-foot-tall  Robart  could  also  speak. 
Voice  synthesis  was  not  only  used  to  warn  of  the  presence  of  intruders  or  other  alarm 
conditions,  but  could  also  report  on  the  internal  status  of  its  circuits,  system  configuration 
errors,  time-of-day,  temperature,  etc. 

Robart's  behavior  appeared  arbitrary,  or  at  least  not  preprogrammed.  An  operating 
system  provided  for  the  selection  of  various  behavior  primitives,  each  designed  to  meet  a 
specific  goal,  based  on  the  output  of  specific  sensors,  via  interrupt  software.  When  no 
specific  actions  were  called  for,  a  routine  wasrandomly  chosen  from  a  preprogrammed  set 
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of  sixteen  routines  that  filled  in  the  gaps.  Some  of  these  routines  would  move  the  robot 
more  or  less  randomly  to  a  new  vantage  point,  where  it  might  elect  to  stop  and  re-enter  the 
surveillance  mode.  Motion  under  these  circumstances  usually  involved  moving  straight 
ahead,  unless  it  saw  an  object,  in  which  case  it  would  swerve  to  one  side  or  the  other  as 
appropriate.  It  would  then  continue  moving  in  the  new  direction  until  it  encountered 
another  obstacle. 

Robart  could  also  be  put  in  either  the  "Hostile"  or  "Friendly”  mode.  In  the 
"Friendly"  mode  it  would  greet  a  person  with  an  amiable  "Hi"  or  "Hello”,  while  in  the 
"Hostile"  mode  it  would  announce  "Intruder,  Intruder",  and  then  advise  the  intruder  to 
leave  the  room. 

All  sensors  were  interfaced  to  one  6502  based  SYM-1  computer  on  an  interrupt  basis. 
A  triangular  wheelbase  was  utilized,  with  the  one  front  wheel  providing  power  and  steering. 
Optical  encoders  were  not  used  so  dead  reckoning  was  not  performed,  but  an  A/D 
converter  gave  four  bits  of  information  on  steering  wheel  angle.  The  rotating  head  had 
similar  resolution.  In  the  worst  case,  wheel  and  head  together  could  have  as  much  as  22 
degrees  of  error  when  looking  for  the  recharging  station.  This  was  done  on  purpose, 
however,  to  demonstrate  the  feasibility  of  software  compensation.  In  over  200  dockings, 
Robart  only  failed  once  to  hit  its  recharging  station  within  half  an  inch  from  the  centerline 
of  its  front  bumper.  The  entire  robot  was  powered  by  one  12V  20  amp-hour  battery, 
providing  roughly  ten  hours  of  service,  with  fourteen  hours  needed  for  full  recharge. 

2.6  Robart  II  1982  • 

Robart  II  [Everett  85a],  the  robot  used  in  this  thesis,  is  an  improved  version  of  its 
predecessor,  Robart  1.  Robart  II  is  a  battery  operated  autonomous  mobile  robot  which 
stands  four  feet  tall,  and  measures  17  inches  across  at  the  base.  The  number  of  on-board 
65C02  (CMOS)  based  computers  has  been  increased  from  one  to  eight,  allowing  for  more 
sensors  to  be  interfaced  and  for  more  processes  to  run  in  parallel. 

The  platform  houses  a  variety  of  sensors  for  path  planning,  collision  avoidance,  and 
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environmental  awareness.  These  include  six  ultrasonic  rangefinders,  fifty  near-infrared 
proximity  detectors,  a  long  range  near-infrared  rangefinder,  plus  various  sensors  used  to 
detect  special  alarm  conditions,  such  as  fire,  smoke,  toxic  gas,  flooding,  vibration  and 
intrusion.  F  our  true  infrared  motion  detectors  are  employed  for  detecting  the  presence  of 
an  intruder  up  to  seventy-five  feet  away,  reacting  to  the  thermal  radiation  emitted  by  the 
human  body.  Special  internal  circuitry  checkpoints  are  analyzed  by  self-diagnostic  software 
and.  when  necessary,  operator  assistance  is  requested  through  speech  synthesis. 

A  front  view  of  Robart  I!  (Figure  2-13)  shows  the  five  sonar  transducers  on  the  body 
and  one  on  the  rotatable  head.  The  long-range  near-infrared  sensor  with  parabolic 
reflecting  dish  sits  on  top  of  the  head  and  three  of  the  true  infrared  motion  detectors  are 
mounted  just  below  the  head.  A  rear  view  (Figure  2- 14)  exposes  the  card  cage  which  houses 
the  eight  computers  and  all  the  interface  circuits. 

Two  twelve-volt  DC  motors  powering  eight-inch  diameter  wheels  are  mounted  on 
either  side  of  the  base  platform  so  as  to  be  symmetrical  with  respect  to  the  vertical  axis  of 
the  robot,  and  are  independently  controlled  to  provide  differential  steering.  Two  non- 
driveable  casters  are  included  for  stability. 

The  computer  architecture  onboard  consists  of  a  SYM-1  6502  board  acting  as  a 
Scheduler  for  the  five  dediealed  MMC-02  65C02  controllers.  These  dedicated  controllers 
directly  interlace  with  the  head,  the  drive  motors,  the  sonars  and  the  speech  synthesis  and 
recognition  processors.  One  controller  also  controls  two  linear  CCD-array  cameras  which 
are  still  to  be  installed.  TTic  SYM-1  manages  all  communications  between  processors  and 
holds  global  information  which  several  controllers  may  need.  All  low  level  controllers 
receive  commands  from  the  SYM  1  via  an  eight  line  parallel  bus,  and  communicate 
information  back  up  via  a  common  serial  interface. 

A  second  low  level  dedicated  65C02  controller  is  used  to  operate  six  ultrasonic  ranging 
modules  through  a  special  multiplexing  circuit  (Everett  85b).  Five  of  these  units  have  their 
transducers  arranged  in  a  forward-looking  array,  with  overlapping  beam  patterns.  These 
transducers  can  be  sequentially  fired  in  any  combination,  as  determined  by  the  command 
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Figure  2- 1 4:  Rear  View  -  Controlling  Computers  -  Five  re  tt  85a 


from  the  SYM  1.  Associated  ranges  are  then  feci  back  up  to  the  Scheduler.  A  sixth 
transducer  is  mounted  on  the  rotatable  head,  positionablc  up  to  100  degrees  either  side  of 
centerline  I  he  position  and  velocity  of  the  head  are  controlled  by  another  dedicated  low 
level  microprocessor.  The  head  controller  also  interfaces  to  the  A/D  converter  which 
monitors  the  environmental  sensors  and  the  proximity  detectors. 

A  fourth  dedicated  controller  is  assigned  the  function  of  controlling  a  D I’- 1050 
micropnvessiu  based  speech  synthcsi/er,  and  a  speech  recognition  board  which  can 
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Robart  II  Architecture 


Figure  2-15:Architecture  of  the  Controllers 


recognize  up  to  256  speaker-trained  words.  The  firth  low  level  processor  controls  the  drive 
wheels  through  pulse  width  modulation.  The  two  wheels  are  synchronized  by  encoders  on 
the  drive  shafts.  The  encoders  also  supply  distance  and  velocity  information  for  dead 
reckoning. 

An  off-board  IBM-XT  was  used  to  gather  the  sonar  and  infrared  data  presented  in  this 
thesis.  The  SYM-1  Scheduler  directed  the  head  and  sonar  controllers  to  file  the  infrared 
and  sonar  sensors  once  each  as  the  head  moved  through  256  positions.  All  readings  were 
returned  to  the  SYM-1  and  shipped  out  to  the  XT,  which  was  mnning  a  C  program  to 
capture  the  data.  Forty-six  sets  of  scans  were  taken  as  the  ubot  was  positioned  throughout 
the  room,  and  all  this  data  was  transported  to  a  LISP  Machine  at  MIT  for  further 
processing. 


Much  of  the  structural  hardware  on  Rohart  II  was  already  assembled  when  I  began 
my  axvpcrative  education  assignments  at  Naval  Surface  Weapons  Center.  One  computer 
had  been  installed  and  a  few  sensors  had  been  mounted  and  interfaced.  Nevertheless, 
I.CDR  H\crett  and  I  spent  uncountable  hours  and  many  weekends  over  the  next  year 
hacking  circuits,  software,  vendors  and  packaging,  to  bring  eight  computers  on  line,  make 
them  all  talk  to  each  other,  control  the  drive  wheels  and  head,  interface  with  the  sensors, 
and  make  Robart  talk  and  follow  us  around.  In  the  end,  we  had  a  very  beautiful  and 
elegant  robot. 
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Chapter  Three 
Modeling  the  Sensors 


The  survey  of  the  previous  chapter  shows  clearly  that  much  work  remains  to  be  done 
in  integrating  sensory  data  into  robot  systems  that  can  produce  intelligent  action.  In  order 
for  a  robot  to  analyze  the  incoming  data  effectively,  it  must  have  a  good  model  of  the 
strengths  and  weaknesses  of  its  sensors. 

The  Sonar  Rangefinder 

A  careful  look  at  the  specifications  for  the  Polaroid  transducer  [Polaroid  84]  along 
with  a  few  simple  measurements  initiate  the  following  as  a  reasonable  model.  Figure 
3-1  shows  die  radiation  pattern  for  the  transducer. 


Figure  3*1: Radiation  Pattern  of  the  Sonar  Transducer  -  Polaroid  84 


The  bcamwidth.  typically  measured  at  the  3dB  point,  is  shown  to  be  roughly  10 
degrees.  However,  in  actual  practice,  the  transducer  is  sensitive  enough  to  detect  echoes  of 
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energy  transmitted  from  the  sidclobes.  In  testing,  the  rangefinder  could  detect  a  one  inch 
diameter  pole  up  to  an  angle  of  40  degrees.  With  such  a  large  beamwidth.  a  robot  scanning 
a  room  with  this  sensor  perceives  all  objects  to  be  much  wider  than  they  really  are. 

The  rangefinder  is  capable  of  measuring  distances  to  an  object  with  a  resolution  of 
0.12  inch  through  a  range  of  0.9  to  35.0  feet.  This  is  accomplished  by  measuring  the  time  of 
(light  between  a  transmitted  pulse  and  a  returned  echo  and  multiplying  by  the  speed  of 
sound.  The  distance  measure,  however,  is  not  necessarily  the  distance  in  the  direction  the 
sensor  is  pointing,  since  the  width  of  the  beam  may  cause  an  echo  from  one  edge  to  be 
returned  before  the  echo  from  the  centerline.  Figure  3-2  illustrates  that  although  the  sensor 
is  pointing  in  the  di'  'ction  along  AB,  the  measured  distance  returned  is  actually  AC. 


Figure  3-2:Scnsor  Measures  Shortest  Distance  to  a  Wall 


Another  measurement  error  is  due  to  specular  reflections  on  smooth  surfaces.  Due  to 
the  large  wavelength  of  sound,  about  1/8  of  an  inch,  many  surfaces  appear  smooth.  A  sonar 
beam  incident  on  such  a  surface  does  not  reflect  an  echo  directly  to  the  sensor.  Instead,  it 
bounces  off  at  an  angle  equal  to  the  angle  of  incidence,  and  possibly  bounces  off  other 
objects  before  being  detected.  Hence,  the  transducer  measures  a  much  longer  distance  than 
it  should. 

Figure  3-3  illustrates  this  problem.  In  actual  tests  against  a  smooth  suriace  such  as 
sheet  rock,  specular  reflections  occur  when  the  sensor  was  aimed  at  an  angle  less  than  25 
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Figure  3-3:Specular  Reflections  Off  a  Smooth  Surface 

degrees  from  the  surface.  Against  a  rougher  surface  such  as  cinder  block,  there  were  no 
specular  reflections  at  all. 

Other  errors  come  about  due  to  atmospheric  effects,  such  as  the  change  in  the  speed 
of  sound  caused  by  temp-  erature  and  humidity  changes.  The  speed  of  sound  is  a  function 
of  temperature  where: 

Speed  of  sound  =  331.4  ~\  /  (Tin  Kelvin) 

V  273  ^ 

Distances  returned  when  assuming  80  degrees  Farenheit,  but  where  actual 
temperature  is  60  degrees,  will  be  seven  inches  too  long  (Everett  85b].  The  effect  of  relative 
humidity  can  be  found  by  table  look-up  [Maslin  83].  Robart  carries  onboard  sensors  for 
temperature  and  humidity,  so  these  errors  can  be  compensated  for  before  the  sonar  data  is 
sent  on  for  processing  with  the  infrared  data. 

Finally,  there  is  another  type  of  measurement  error  due  to  the  form  of  the  sound  pulse 
which  is  transmitted.  The  pulse  is  actually  a  chirp,  1  ms  long,  of  56  pulses  of  4  different 
frequencies.  There  are  8  pulses  at  60  kHz,  8  pulses  at  56  kHz,  16  pulses  at  52.5  kHz  and  24 
pulses  at  49.41  kHz.  The  time  of  fight  measurement  begins  with  the  rising  edge  of  the  first 
pulse  transmitted,  and  ends  with  the  detection  of  the  first  echo.  Figure  3-4  shows  a  timing 
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Figure  3-4:Transmittcd  Pulse  -  Polaroid  84 

diagram.  Four  different  frequencies  arc  transmitted  to  compensate  for  the  fact  that 
different  types  of  surfaces  absorb  energy  of  different  frequencies.  With  four  frequencies,  it 
is  more  likely  that  every  surface  will  return  an  echo.  However,  that  echo  is  not  necessarily 
associated  with  the  first  pulse,  the  one  from  which  the  time  of  flight  is  measured.  In  the 
worst  case,  an  echo  from  the  hist  pulse  adds  1  ms  onto  the  actual  time  of  flight  This  error 
corresponds  to  about  3  inchqs  of  additional  distance  measurement 

Modeling  the  Infrared  Detector 

The  infrared  sensor  emits  a  pulse  of  irifrared  light  and  then  uses  a  parabolic  dish  and 
an  infrared  detector  to  sense,  over  a  very  narrow  area,  any  returned  infrared  energy.  Unlike 
sonar,  the  time  of  flight  of  the  light  pulse  cannot  be  easily  measured.  The  sensor  merely 
gives  an  indication  of  whether  or  not  any  returned  pulse  was  detected.  So,  although 
distance  to  an  object  cannot  be  ascertained,  the  absence  or  presence  of  an  object  can  be 
determined  with  very  good  angular  resolution. 

The  infrared  emitter  used  here  is  actually  slightly  more  complicated  than  described 
above.  Four  LFDs  arc  used  to  enable  the  sensor  to  incrementally  step  up  the  power  output 
so  that  the  range  is  increased.  A  good  analogy  is  that  of  a  blind  man's  telescoping  cane. 
First  one  FED  is  fired.  If  no  return  is  detected,  two  LFDs  arc  then  fired  simultaneously, 
doubling  the  power  output,  and  extending  the  range  of  the  sensor.  If,  again,  there  is  no 
detection,  three  LFDs  arc  fired  in  unison,  and  finally  four  if  necessary.  When  a  returned 
pulse  is  detected,  the  robot  notes  how  many  LEDs  were  fired,  and  th:3  gives  a  very  rough 
indication  of  range.  However,  information  which  is  more  beneficial  is  to  have  the  robot 
scan  and  note  the  point  at  which  there  is  a  discontinuity  from  detection  to  no  detection,  or 
vice  versa.  This  often  means  a  corner  has  been  found,  as  shown  in  Figure  3-5. 
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Figure  3-5: Infrared  Sensor  Finds  a  Comer 


The  infrared  detector  circuitry  is  also  slightly  more  sophisticated  than  described 
above.  Instead  of  a  binary  Detect/No  Detect  signal,  there  are  actually  four  detectors  of 
varying  sensitivity.  After  the  first  single  LED  is  fired,  the  robot  polls  these  four  detector 
outputs,  starting  with  the  least  sensitive  one.  If  any  detector  has  sensed  an  echo,  the  robot 
notes  how  many  LEDs  were  fired  and  which  level  of  detector  sensitivity  was  first  to  pick  up 
the  echo.  If  none  detected  an  return,  two  LEDs  are  fired  together,  all  the  detectors  are 
polled,  and  the  process  is  repeated  up  to  the  firing  of  all  four  LEDs.  Figure  3-6  shows  a 
diagram  of  the  infrared  sensor. 

The  resulting  data  is  input  to  the  computer  in  a  two-digit  format,  such  as  31.  The  first 
digit  signifies  how  many  LEDs  were  fired  and  the  second  digit  tells  which  detector  circuit 
sensed  the  echo.  In  this  example,  three  LEDs  were  fired  and  the  first  detector  sensed  the 
echo. 

The  maximum  range  of  the  sensor,  using  4  LEDs,  is  approximately  18  feet,  although 
only  3  LEDs  were  actually  used  here.  They  gave  a  maximum  range  of  about  10  feet  The 
range  is  a  function  of  how  much  energy  is  reflected  off  the  surface.  Different  types  of 
surfaces  absorb  varying  amounts  of  infrared  energy,  so  this  sensor  gives  only  a  rough 
estimate  of  distance  ranges.  However,  smooth  surfaces  do  not  pose  the  same  problems  of 
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specular  reflection  as  they  do  with  sonars,  because  of  the  much  shorter  wavelength  of 
infrared  light 


Chapter  Four 

Combining  the  Data  From  Both  Sensors 

After  defining  a  mode!  of  how  the  sensors  work  and  taking  experimental  data,  it’s 
possible  to  formulate  a  set  of  rules  for  how  the  information  from  each  sensor  should  be 
combined  to  build  a  model.  These  rules  will  tell  when  data  from  either  sensor  is  valid  and 
which  sensor  to  rely  on  when  they  give  conflicting  information. 


Figure  4-l:Sonar  Plot  of  the  Room 


All  the  data  was  taken  in  a  basement  in  which  there  were  different  configurations  of 
obstacles  and  the  robot  was  situated  in  various  locations.  Figure  4-1  shows  one  sonar  plot  of 
the  room  in  which  the  robot  was  6  feet  in  front  of  a  wall  that  had  an  open  door.  Also,  to  its 
left,  7.5  feci  away,  was  another  open  door.  The  room  was  19'  7.5"  long  and  13'  8.5"  wide 
and  the  walls  were  relatively  rough,  with  cinder  block  along  one  side  and  exposed  studs 
along  the  others.  Consequently,  the  plot  looks  fairly  clean.  However,  it's  clearly  seen  that, 
even  though  the  doors  are  open,  they  appear  dosed  to  the  robot  because  at  that  distance,  the 
sonar  beam  is  wider  than  the  door. 
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The  sonar  transducer  and  infrared  sensor  are  both  mounted  on  the  robots  head, 
which  rotates  100  degrees  both  left  and  right.  Two  hundred  fifty  six  readings  are  taken  on 
each  scan  while  the  robot  is  held  stationary.  The  angle  at  which  each  reading  is  taken,  and 
the  distance  measured  are  converted  to  cartesian  coordinates  and  overlaid  onto  the  actual 
room  map.  All  the  data  is  displayed  in  a  256  x  256  bitmap  array. 


Figure  4-2:Duta  Values  in  the  Boundary 


Filler  points  arc  added  to  create  a  connected  boundary,  since  the  conversion  from 


cylindrical  coordinates  in  which  the  data  is  taken,  to  cartesian  coordinates  in  which  the  data 
are  displayed  creates  a  sparse  boundary.  Every  data  point  in  the  boundary  carries  with  it 
information  such  as  its  index  into  the  boundary,  its  k-y  position,  the  angle  from  the  robot  at 
which  the  reading  was  taken  with  respect  to  a  coordinate  system  attached  to  the  robot,  the 
radial  distance  measured  by  the  sonar  and  the  infrared  reading.  Figure  4-2  shows  the  actual 
data  values  along  one  smafl  section  of  the  boundary. 

The  circled  region  pointed  to  by  the  arrow  corresponds  to  the  partial  array  of  numbers 
to  the  left  The  zeroes  represent  white  space  while  the  other  numbers  represent  either  actual 
data  points,  filler  points  or  room  obstacles.  For  real  data  points,  there  is  a  set  of  three 
numbers:  the  radial  distance  in  feet,  the  infrared  reading,  and  the  angle  from  the  robot  in 
degrees.  For  instance,  the  point  at  (x  y)  =  (231  186)  is  6.2  feet  from  the  robot,  has  the 
infrared  reading  11  associated  with  it,  and  lies  at  an  angle  of  19.8  degrees  with  respect  to  the 
robot  Filler  points  in  the  boundary  are  marked  as  ones  and  obstacles  and  walls  are  marked 
as  sixes. 

Examples  of  Sonar  Errors 

Many  of  the  problems  with  the  sonar  sensor  that  were  described  in  the  previous 
chapter  show  up  dearly  in  plots  of  the  experimental  data.  Figure  4-1  depicts  how  doorways 
can  be  blurred  so  much  that  they  look  like  walls.  In  addition,  it  also  vividly  displays  the 
problem  of  the  sonar  measuring  the  shortest  distance  within  its  beam,  and  not  necessarily 
the  distance  alorg  the  beam’s  centerline,  as  shown  by  the  arc  of  data  points  along  the 
lefthand  wall.  Where  the  wall  is  perpendicular  to  the  beam,  the  distance  measured  is 
perfect,  but  to  either  side  of  that  point,  the  return  is  shorter  than  it  should  be. 

The  sonar  not  only  blurs  out  doors,  but  also  makes  small  thin  obstacles  look  very 
wide.  Figure  4-3  shows  a  pole  towards  the  lefihand  side  of  the  image  which  appears  to  be  as 
wide  as  the  sonar  beam.  Another  problem  occurs  when  the  sonar  is  pointed  towards  a 
comer  of  the  room.  Either  side  of  the  comer  is  closer  to  the  sensor,  so  an  underestimate  is 
always  returned.  Consequently,  the  real  comers  of  the  room  become  very  hard  to  locate. 
This  phenomenon  appears  in  the  upper  righthand  corner  of  Figure  4-4. 


Figure  4-3:A  Pole  Creates  a  Wide  Sonar  Image 
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A  good  example  of  ihc  specular  reflection  problem  is  portrayed  by  Figure  4-5.  Two 
I. -shaped  obstacles  were  placed  in  the  room  and  the  robot  is  about  a  fexat  and  a  half  from 
one  of  them.  Ihc  obstacle  really  was  a  folding  closet  door  that  had  a  series  of  horizontal 
sla.ts  that  were  angled  downward.  The  angle  of  these  surfaces  caused  the  beam  to  bounce 
tiff  in  some  other  direction  before  finally  being  detected  by  the  transducer,  giv  ing  a  distance 
reading  much  longer  than  was  really  the  case.  The  result  is  that  the  robot  is  blind  to 
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Figure  4*5:Spccular  Reflection  Caused  by  Angled  Surfaces 


portions  of  the  obstacle  right  in  front  of  it 

Infrared  Results 

Under  certain  conditions,  the  infrared  sensor  is  capable  of  locating  edges  of  doorways 
very  well.  It  does  this  by  noticing  changes  from  where  it  detects  something  to  where  it 
detects  nothing.  If  there  is  a  long  clear  distance  beyond  the  door  and  the  edge  of  the  door  is 
within  the  infrared's  range,  then  that  edge  can  be  located  with,  a  very  good  angular 
resolution. 

It  was  hoped  that  the  stepped  power  output  of  the  emitter  and  the  four  levels  of 
sensitivity  of  the  detector  would  enable  the  sensor  to  detect  less  drtistic  changes  in  depth, 
and  therefore  enable  the  sensor  to  discern  if  one  object  was  a  few  feet  in  fron  of  another.  It 
turned  out  however,  that  the  sensor  was  not  capable  of  such  performance  because  the 
vary  ing  reflectivities  of  surface  materials  in  the  room  precluded  any  attempt  at  correlating 
range  with  either  emitter  output  or  detector  sensitivity.  Nevertheless,  the  infrared  sensor 
could  consistently  pick  out  edges  of  doorways  that  were  completely  invisible  to  the,  sonar. 

Figure  4-6  shows  two  scenes  in  which  the  sonar  blurs  the  doors,  but  the  infrared  is 
able  to  pick  out  the  edges  very  accurately.  The  crosses  mark  where  the  infrared  detects 
corners,  or  specifically,  where  infrared  readings  of  30  transitioned  to  anything  else. 
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Readings  of  30  indicau  that  all  three  I  HDs  were  fired  and  no  echo  was  detected.  In  the  plot 
on  the  left,  the  sonar  is  able  to  see  a  passageway  through  the  dix;r  at  the  top  of  the  picture 
and  also  through  the  d'  Xir  on  the  left.  However,  the  sonar  smears  the  edges  of  the  leflhand 
doorway,  but  the  crosses  marked  by  the  infrared  make  it  obvious  that  the  infrared  was  able 
to  find  the  doorway  edges  where  they  really  were.  Similarly  for  the  plot  on  the  right,  the 
crosses  at  the  top  of  the  room 'show  that  the  infrared  correctly  finds  the  edges  of  the  dorrjam 
again,  while  the  sonar  is  completely  blind  to  them. 


Figure  4-6: Infrared  Sensor  Picks  Out  the  Doors 


However,  the  infrared  sensor  isn't  so  gixxi  at  finding  edges  of  obstacles  that  don't  have 
long  empty  region  of  space  behind  them.  Figure  4-7  shows  a  pole  blurred  by  the  sonar. 
I  he  sixes  in  the  array  to  the  left  mark  the  location  of  the  pole,  but  the  infrared  values 
nearby  don't  show  much  change.  Ihe  values  starting  with  the  point  (48  184)  and  following 
the  edge  of  data  points  dow  n  to  point  (47  197)  are  the  second  items  in  each  data  point  (17, 
13.  11.  13.  14.  14.  from  top  to  bottom).  This  is  because  the  wall  behind  the  pole  is  relatively 
close. 

The  problem  that  arises  in  trying  to  correlate  range  with  emitter  output  can  be  see  in 
f  igure  4  8  [he  points  (174  79)  and  (175  79)  show  their  infrared  values  swinging  from  31  to 
11.  yet  there  is  no  large  jump  in  distance.  The  edge  of  the  door  has  been  delected  between 
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Figure  4-7:Infrared  Can’t  Find  the  Pole 

the  30  and  31  transiiion.  the  points  (168  77)  and  (172  79). 

The  point  which  has  the  infrared  reading  of  11,  point  (176  79),  illustrates  the  crux  of 
the  problem.  It  is  actually  8.2  feet  away,  so  the  ranges  available  with  the  various  stepped 
outputs  of  the  emitter  overlap  quite  a  bit.  This  means  that  the  infrareds  can't  really  be  used 
to  signal  a  specular  icHection  caused  by  the  sonar  either.  Take  for  example.  Figure  4-9, 
which  clearly  shows  the  sonar  wrongly  stating  that  there  aren't  any  obstacles  for  seven  feet. 
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Figure  4-8:lnfrarcd  Ranging  Is  Hazy 

Fxen  lliough  the  infrared  readings  appear  to  detect  a  closc-by  object,  it  can't  overrule  the 
sonar  because  in  some  eases  one  I  .FI)  can  sec  out  to  8.2  feet  as  in  the  example  above. 

In  other  cases,  the  infrared  sensor  detects  edges  where  there  arc  none.  This  happens 
when  the  sensor  is  scanning  along  a  wall  amt  reaches  the  horizon  of  its  range.  It  suddenly 


notice s  a  change  from  seeing  something  to  seeing  nothing,  not  because  it  has  found  a 
doorw.r,  or  a  comer,  but  because  the  wall  is  going  in  such  a  direction  that  it  veers  out  of  the 
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Figure  4-9:Infrared  Can't  Adjust  For  Sonar's  Specular  Reflection 


limits  of  the  sensor.  The  crosses  in  Figure  4-10  show  where  the  infrared  just  starts  to  detect 
the  wall. 

There  arc  yet  other  problems.  Since  the  detector  is  focused  over  a  very  narrow  area, 
thin  objects  can  often  be  overlooked.  This  is  one  problem  the  sonar  has  no  trouble  with, 
however.  In  Figure  4-11.  the  infrared  fails  to  detect  the  pole  at  a  distance  of  8.2  feet,  yet  in 
Figure  4-12.  i'  dearly  finds  a  wall  at  10  feet.  The  parabolic  reflector  which  limits  the  area 
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Figure  4-IO:lnfrarcd  Reaches  the  Limits  of  Its  Range 


over  which  the  infrared  detects  echoes,  also  causes  it  to  miss  relatively  thin  obstacles  during 
a  scan. 

Rules  for  Fusing  Information 

Taking  into  account  all  the  limitations  listed  above,  the  following  rules  are  used  for 
fusing,  in  formation  from  the  two  sensors: 

*  Whenever  the  infrared  sensor  detects  a  30-to-other  discontinuity,  a  change  from 
detection  to  no  detection,  and  the  associated  sonar  reading  is  less  than  10  feet, 
then  it's  very  likely  that  a  valid  depth  discontinuity  has  been  detected. 

*  If  the  sonar  reading  is  greater  than  the  maximum  range  for  the  infrared,  then 
ignore  the  infrared. 

*  If  the  sonar  reading  is  at  its  maximum  value,  then  the  real  distance  is  greater. 

Using  these  few  rules  the  original  sonar  boundary  can  be  redrawn  to  take  into  account 
any  passageways  found  by  the  infrared  sensor.  This  is  done  by  finding  a  pair  of  edges  and 
redrawing  the  boundary  in  between  to  be  an  arc  at  the  maximum  of  cither  the  horizon  of 
the  infrared  or  the  furthest  sonar  reading  returned  between  the  two  edges  of  the 
passageway.  Figure  4-13  shows  an  original  boundary  and  the  redrawn  niap  based  on 
information  from  both  sensors.  The  original  boundary  is  shown  in  the  right  hand  picture 
with  the  new  boundary  overlaid  on  it  at  the  locations  where  the  infrared  marked  the  edges. 
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Figure  4-1  l:The  Infrared  Misses  the  Pole  at  8.2  Feet 

The  infrared  edges  are  marked  by  the  crosses,  and  those  are  the  positions  at  which  the 
infrared  noticed  a  transition  from  detecting  something  tc  detecting  nothing  (and  vice  versa 
for  the  other  edge).  The  left  hand  figure  shows  the  modified  boundary  after  combining  the 
infrared  data  with  the  sonar.  It's  clear  that  the  doors  are  more  pronounced  after  filtering 
with  the  infrared. 

Due  to  the  problems  mentioned  earlier  about  the  infrared  falsely  detecting  doors, 
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Figure  4-12:The  Wall  is  Detected  at  10  Feet 


because  it  has  reached  the  limits  of  its  range,  sometimes  the  boundaries  are  redrawn 
incorrectly  as  in  Figure  4-14.  The  real  distance  to  the  wall  is  beyond  the  horizon  of  the 
infrared,  but  the  sonar  returns  a  foreshortened  reading  because  it's  aimed  towards  a  comer. 
Consequently,  the  infrared  data  is  not  ignored,  and  the  robot  thinks  it  has  found  another 
door.  However,  after  a  sequence  of  moves  the  robot  will  see  the  same  walls  from  difTercnt 
vantage  points. and  these  false  doors  can  be  dismissed.  This  modified  boundary  is  then 
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Figure  4*13:Rcdrawn  Map  From  Combining  Sensors 


Figure  4-14:Infrared  Detects  False  Door  in  the  Comer 


input  to  the  next  stage  of  processing  which  builds  a  representation  of  the  room  better  suited 
fora  path  planner. 


Chapter  Five 

Building  a  Representation 


After  combining  data  from  two  sensors  to  create  a  refined  view  of  the  room,  this  new 
map  needs  to  be  converted  to  a  representation  suitable  for  planning  intelligent  tasks,  such  as 
navigating  through  a  workspace.  This  is  achieved  by  first  transforming  the  refined  data  to 
an  intermediate  representation,  the  curvature  primal  sketch  [Brady  84],  which  is  convenient 
for  merging  separate  views  between  robot  moves.  The  curvature  primal  sketch 
representation  can  then  be  easily  converted  into  a  polygonal  representation  of  the  world 
suitable  for  path  planners  [Brooks  83,  Brooks  85]. 

The  curvature  primal  sketch  describes  a  boundary  by  a  spline  whose  knot  points  are 
significant  changes  in  curvature.  These  knot  points  are  located  by  taking  first  and  second 
derivatives  of  Gaussians  at  various  scales  which  have  been  convolved  with  the  boundary, 
and  then  looking  for  patterns  of  zero  crossings,  maxima  and  minima  of  the  smoothed  lesult 
that  correspond  to  models  of  curvature  change  such  as  those  produced  by  comers,  ends  and 
smooth  joins.  Figure  5-1  shows  an  example  of  the  curvature  primal  sketch  at  two  scales 
along  with  the  refined  boundary.  Instead  of  fitting  a  spline  to  these  knot  points,  however, 
straight  line  approximations  are  used.  Straight  line  approximations  are  appropriate  for  a 
robot’s  world  in  which  walls,  desks,  tables  and  other  obstacles  are  most  often  composed  of 
straight  edges. 

There  are  some  nice  features  of  the  curvature  primal  sketch  which  make  it  a  desirable 
representation  for  this  application,  the  task  of  building  a  room  map  out  of  several  views 
from  different  positions  as  the  robot  goes  exploring.  First,  the  knot  points  are  convenient 
for  matching  subsequent  scans  because  they  have  local  support  That  is,  the  curvature 
primal  sketch  isn’t  based  on  any  global  properties  of  the  data  plot  of  the  room,  such  as 
length,  width,  or  aspect  ratio.  Rather,  the  knot  points  are  determined  only  by  their 
relationship  to  their  neighbors.  This  is  important  when  trying  to  merge  several  views  of  the 


Figure  5*l:The  Curvature  Primal  Sketch  of  the  Refined  Boundary 

room,  because  from  different  viewpoints  the  global  shape  of  the  room  can  change 
drastically.  Some  objects  can  become  occluded  while  others  become  uncovered.  The  knot 
points  in  the  curvaure  primal  sketch,  however,  will  stay  close  to  the  same  location  for 
incremental  changes  in  the  robot  position.  Figure  5-2  shows  the  curvature  primal  sketch  of 
the  robot's  view  after  it  has  moved  two  feet  to  the  right  with  respect  to  Figure  5-1.  From  its 
new  position,  the  robot  is  unable  to  see  the  same  areas  behind  the  circular  object  as  from  its 
earlier  position.  Figure  5-3  shows  the  robot's  perspective  after  it  moves  two  feet  forward 
from  its  position  in  Figure  5-2.  Previously  occluded  areas  can  be  seen  to  become 
uncovered.  However,  edges  that  were  visible  from  both  perspectives  have  similar  knot 
points. 

Second,  the  knot  points  are  found  reliably  and  consistently  due  to  processing  at 
multiple  scales.  Gaussian  filters  with  a  large  base  of  support  smooth  out  noise  and  detect 
occurences  of  curvature  changes,  while  gaussians  with  small  support  can  then  be  used  to 
localize  those  occurences.  This  is  analogous  to  previous  work  on  finding  edges  in  images 
[Canny  83]. 
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more  suitable  for  planners.  Path  planning  programs  typically  expect  a  list  of  polygons  as 
input  for  their  simulated  worlds.  The  straight  line  segments  between  knot  points  can  easily 
be  converted  to  very  thin  rectangles,  translated  and  rotated  appropriately., 


Figure  5-4:Polygonal  Representation  for  a  Planner 


This  conversion  is  shown  in  Figure  5-4  where  the  plot  of  Figure  5-2  is  represented  as  a 
set  of  linked  rectangles  of  width  one.  ready  for  input  to  a  planner  based  on  generalized 
cones  (Brooks  83].  This  planner  finds  freeways  and  channels  in  the  room  by  carving  up  the 
freespace  into  generalized  cones.  Generalized  cones  have  a  spine,  and  are  parameterized  by 
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some  sweeping  rule  which  says  how  the  width  to  either  side  of  the  cone  varies.  All  pairs  of 
edges  of  the  polygons  in  the  workspace  are  compared  and  heuristics  are  used  to  prune  the 
number  of  cones  generated.  The  spines  of  the  resulting  cones  arc  illustrated  in  Figure  5-5. 


Figure  5-5:Spincs  of  Generalized  Cones  Representing  the  Freespace 


Paths  from  start  to  goal  arc  determined  by  searching  the  spines  of  the  cones  for  the 
optimal  route.  The  robot  translates  along  the  length  of  the  spines  and  rotates  at  the 
intersection  of  two  spines.  The  path  found  from  specified  start  and  goal  locations  is  shown 
in  figure, 5-6.  This  path,  slightly  mere  complicated  than  need  be,  shows  that  this  planner 


doesn't  work  well  with  lots  of  obstacles.  This  is  also  clear  from  the  number  of  cones 
generated  in  Figure  5-5. 


several  views  from  different  perspectives,  the  robot  has  to  be  intelligent  about  what  new 
pieces  of  data  to  include  in  his  global  model  and  what  old  pieces  to  throw  away.  Basically,  it 
must  match  what  he  can  and  make  decisions  about  what  to  do  with  the  rest 


Figure  5-7:Three  Views  Overlaid  Two  at  a  Time 

l  itis  problem  is  illustrated  in  Figure  5-7.  The  earlier  examples  of  three  di  fferent  views 
of  the  room  arc  translated  and  scaled  appropriately,  then  overlaid  two  at  a  time  so  as  not  to 
become  too  cluttered.  It's  clear  that  some  edges,  such  as  the  wall  to  the  right,  tend  to  match 
up  between  views  but  that  other  edges,  such  as  those  behind  the  circular  obstacle,  tend  to 
change.  From  the  fu  st  position,  the  robot  secs  an  area  behind  and  to  the  left  of  the  circular 
obstacle,  f'lom  the  second  position,  that  area  becomes  occluded.  After  the  robot  moves 
forward,  however,  it  again  secs  the  same  area  behind  the  obstacle,  only  now  it  secs  it  from 
the  left  side.  The  robot  should  infer  that  these  areas  tire  connected  and  that  a  small  object 
lies  in  the  middle  of  the  room.  The  desired  output  of  a  program  that  was  intelligent  about 
how  to  do  such  a  merge  might  look  something  like  Figure  5-8. 

Some  ideas  on  solving  this  problem  (Chatila  85]  were  mentioned  earlier  in  the  section 
on  flilare.  In  that  work,  the  robot  scans  the  room  from  one  perspective  and  assigns  local 
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Figure  5-8:Dcsired  Room  Map  After  Merging  Views 

coordinate  frames  to  clusters  of  data  points  that  are  line-fitted.  Edges  adjoining  these  local 
frames  are  marked  as  fake  edges.  By  knowing  which  edges  arc  fake  and  which  are  real,  the 
robot  can  hypothesize  about  what  it  might  see  after  its  next  move.  That  is.  it  can  assume 
that  new  objects  might  become  uncovered  through  these  fake  edges.  It  can  also  predict 
what  objects  might  become  occluded.  Then  after  the  next  move  and  Scan,  it  matches  new 
edges  to  appropriate  local  coordinate  frames  of  prev  iously  discovered  objects.  Having  edges 
referenced  to  local  coordinate  frames  reduces  uncertainty  in  the  building  of  the  final  map. 
For  edges  that  can't  be  matched,  decisions  are  made  based  upon  whether  or  not  the  new 
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information  coincides  with  what  w;ls  predicted. 


Some  of  these  ideas,  modified  slightly,  can  be  incorporated  into  our  problem  of 
merging  sonar  maps.  One  big  difference  lies  in  the  problem  of  deciding  what  to  match.  In 
the  Hilare  work,  they  assume  they'll  have  laser  rangefinder  data  which  is  much  more 
accurate  than  sonar  data.  Consequently,  the  local  coordinate  frames  can't  be  built  here 
because  the  sonar  blurs  corners  that  come  out  towards  the  robot  However,  the  important 
idea  of  matching  landmarks  that  have  local  support  can  be  retained,  because  of  the 
availability  of  the  knot  points  from  the  curvature  primal  sketch.  These  knot  points  mark 
features  in  the  scan  that  depend  only  on  characteristics  of  neighboring  points,  namely 
curvature  changes. 

Furthermore,  Jie  fake  edges  described  in  the  Hilare  work  are  already  included  in  the 
curvature  primal  sketch  representation.  These  edges  are  made  up  of  the  filler  points 
mentioned  earlier,  when  the  connected  boundary  was  created  from  the  sparse  sonar  data. 
The  connected  boundary  was  created  from  the  raw  data  in  order  to  make  the  curvature 
primal  sketch  axle  run,  but  explicitly  keeping  this  information  about  filler  points  can  be 
useful  for  marking  fake  edges  between  knot  points.  Knowledge  about  which  edges  between 
knot  points  are  fake  and  which  arc  real  is  useful  for  hypothesizing  where  obstacles  can 
become  occluded  or  uncovered. 

After  deciding  what  are  close  matches  and  what  can  be  thrown  away  or  added, 
algorithms  appropriately  unioning  and  intersecting  the  appropriate  polygons  can  be  taken 
from  work  in  computational  geometry  [Weiler  77],  Basically,  a  union  of  the  plots  shown  in 
Figure  5-7  would  produce  the  desired  output  of  Figure  5-8.  The  difficulty  lies  in  making 
decisions  about  what  constitutes  a  match  and  about  what  polygons  to  intersect  or  union 
when  there  isn’t  a  match. 

After  a  global  room  map  is  created,  it  can  then  be  used  as  a  model  of  the  robot’s  entire 
workspace  upon  which  it  can  plan  tasks.  An  example  of  another  planner,  based  on 
configuration  space  [Brooks  85],  is  shown  in  Figure  5-9.  Configuration  Space  is  a 
representation  of  the  workspace  in  which  the  robot  is  shrunk  to  a  point  and  the  obstacles  are 
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Figure  5-9:Confip,uration  Space  Planner 


grown  appropriately.  Growing  an  obstacle  by  translating  the  robot  to  each  of  its  vertices 
produces  a  two  dimensional  configuration  space,  and  the  problem  of  finding  a  free  path  for 
a  polygonal  robot  through  polygonal  obstacles  reduces  to  the  problem  of  finding  a  path  for 
a  point  through  these  grown  polygons.  However,  allowing  the  robot  to  rotate,  creates  a 
three  dimensional  configuration  space  where  the  obstacles'  surfaces  can  be  curved,  so  the 
problem  of  finding  the  path  of  a  point  through  this  space  becomes  much  harder.  The 
solution  this  planner  uses  is  to  slice  up  the  three  dimensional  configuration  space  and  look 


for  subpaths  that  involve  only  translation.  It  finds  a  path  as  far  as  it  can.  and  then  tries  a 
new  orientation  in  a  different  slice  of  the  configuration  space.  This  planner  is  guaranteed  to 
find  a  path  is  one  exists.  A  harder  problem  given  to  this  planner  is  shown  in  Figure  5*10. 


Figure  5*10:Hardcr  Problem  For  the  Planner 


For  a  three  dimensional  world,  configuration  space  gets  very  complicated.  Allowing 
for  three  degrees  of  rotations  produces  a  six  dimensional  configuration  space.  However,  for 
mobile  robots  dial  don't  fly.  and  for  a  two  dimensional  model  of  the  world,  a  three 
dimensional  implementation  of  configuration  space  works  quite  well. 
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Chapter  Six 
Conclusion 


Two  inexpensive  sensors.a  Polaroid  sonar  transducer  and  a  novel  infrared  sensor  have 
been  combined  to  produce  a  refined  map  of  the  robot's  workplace  which  is  suitable  for  use 
by  intelligent  path  planning  programs. 

The  sensors  were  modeled,  lots  of  experimental  data  was  analyzed,  and  rules  were 
developed  for  combining  their  information.  The  sonar  had  the  effect  of  blurring  both 
obstacles  and  passageways,  although  was  quite  accurate  in  providing  distance  information  to 
the  nearest  object.  The  infrared  sensor,  a  device  with  stepped  power  output  and  varying 
levels  of  detector  sensitivity,  although  unable  to  provide  any  accurate  distance 
measurement,  was  able  to  detect  the  absence  or  presence  of  an  obstacle  with  very  good 
angular  resolution.  It  could  very  reliably  pick  out  edges  of  doorways  that  were  invisible  to 
the  sonar.  However,  it  didn't  work  as  well  as  was  hoped  in  finding  edges  of  obstacles  when 
other  objects  lay  just  a  few  feet  behind  the  first.  This  was  due  to  the  fact  that  the  infrared 
sensor's  ranging  capability  was  far  too  coarse.  Rules  were  developed  specifying  how  sensory 
data  should  be  combined  and  the  raw  data  was  then  refined  into  a  map  that  was  more 
accurate  than  if  either  sensor  was  used  alone. 

This  refined  map  was  converted  into  an  intermediate  representation,  the  curvature 
primal  sketch,  which  represented  the  boundary  with  knot  points  that  marked  significant 
changes  in  curvature  of  the  smoothed  boundary.  The  original  contour  was  then 
approximated  by  connecting  these  knot  points  with  straight  line  segments.  From  this 
representation,  it  was  a  simple  step  to  convert  to  a  representation  expected  by  path  planning 
programs,  namely,  a  list  of  polygons.  Each  straight  line  segment  in  the  boundary  was 
converted  to  a  list  of  very  thin  linked  polygons  that  were  translated  and  rotated 
appropriately.  Examples  of  such  a  representation  used  with  a  planner  based  on  generalized 
cones  was  given 
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An  even  more  important  reason  for  choosing  the  curvature  primal  sketch  as  an 
intermediate  representation  was  that  it  had  certain  features  which  made  it  amenable  to 
creating  a  global  room  map  by  merging  views  of  the  room  taken  from  different  perspectives. 
Because  the  calculation  of  the  knot  points  was  based  on  local  information.  Lie  curvature  of 
neighboring  points,  they  represented  landmarks  that  were  invariant  from  move  to  move, 
provided  the  moves  were  small.  Therefor*.,  these  knot  points  could  be  used  to  match  and 
track  similarities  between  data  scans  taken  on  subsequent  moves.  A  method  for  taking 
several  plots  and  merging  them  into  a  global  map  was  discussed.  An  example  of  another 
planner,  one  based  on  configuration  space,  was  shown  for  such  a  map.  Consequently,  two 
examples  of  intelligent  programs,  path  planners,  were  shown,  which  were  run  not  or. 
simulated  data,  but  on  real  data  produced  from  very  cheap  sensors. 
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